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LARGE DEVIATIONS ASSOCIATED WITH POISSON DIRICHLET 
DISTRIBUTION AND EWENS 
SAMPLING FORMULA 

By Shui Feng ^ 

McMaster University 

Several results of large deviations are obtained for distributions 
that are associated with the Poisson-Dirichlet distribution and the 
Ewens sampling formula when the parameter 6 approaches infinity. 
The motivation for these results comes from a desire of understanding 
the exact meaning of 9 going to infinity. In terms of the law of large 
numbers and the central limit theorem, the limiting procedure of 6 
going to infinity in a Poisson-Dirichlet distribution corresponds to a 
finite allele model where the mutation rate per individual is fixed and 
the number of alleles going to infinity. We call this the finite allele 
approximation. The first main result of this article is concerned with 
the relation between this finite allele approximation and the Poisson- 
Dirichlet distribution in terms of large deviations. Large 6 can also 
be viewed as a limiting procedure of the effective population size 
going to infinity. In the second result a comparison is done between 
the sample size and the effective population size based on the Ewens 
sampling formula. 

1. Introduction. It is an effective tool to study the infinite dimensional 
model using their finite dimensional counterpart even though differences, 
sometimes essential, exist between the two. In this article we focus on a 
probability distribution, the Poisson-Dirichlet distribution with parameter 
6 > [henceforth denoted by PD{9)], on an infinite dimensional space, and 
its finite dimensional counterpart, the Dirichlet distribution. 

Let 

V= |(pi,p2,---) ■Pi>P2>-- - >0,Y^^pk = l|. 
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The infinitely-many-neutral-alleles model is a V-valued process describing 
the evolution of a population under random sampling and parent indepen- 
dent mutation. If the total mutation rate is 9, then the stationary distri- 
bution of this process is the PD{9). Kingman [15] introduced the PD{9) as 
an asymptotic distribution of the descending order statistics of a symmetric 
Dirichlet distribution with parameters K, a when K ^ oo and a — in a 
way such that limK~*ooKa = 9. We use F{9) = {Pi{9), P2{9), . . .) to denote 
the V-valued random variable with distribution PD{9). It is equal in law to 
the normalized jump sizes of Gamma process over the interval (0,0) ranked 
in descending order. A more friendly description of PD{9) is as follows. 

Consider an i.i.d. sequence of random variables Uk,k = 1,2,..., with com- 
mon distribution Beta{l,9). Set 

(1.1) Xi = C/i, Xn = {l-Ui)---{l-Un^i)Un, n>2. 

Then the law of the descending order statistics of Xi,X2, ... is PD{9). The 
representation in (1.1) is called the GEM representation after R. C. Griffiths, 
S. Engen and J. W. McCloskey. The sequence Xi^,k = 1,2, . . ., corresponds 
to the size-biased permutation of PD{9) and can be obtained through the 
size-biased sampling of a symmetric Dirichlet distribution with parameters 
K, a following the procedure of if ^ oo, a — > 0, and Ka 9. 
For any fixed n > 1 , let 

= |(ai, • • • ,an) :afc > 0, = 1,. . . ,n;^iai = n|. 

Consider a random sample of size n from a Poisson-Dirichlet population 
and for k = 1, . . . ,n, define 

(1.2) Ak = the number of alleles appearing in the sample exactly k times. 

Then A„ = {Ai, ...,A „) is an ^„-valued random variable with distribution 
given by the well-known Ewens sampling formula [7]: 




where 6l(„) = 9{9 + 1) ■ ■ ■ {9 + n - I). 

Consider instead a random sample of size n from a symmetric Dirichlet {a, 
. . . ,a) distribution with 9 = Ka, and let k = J27=i then 

P{A„, = (ai,...,a„)} 

(1.4) 

n! a'=r(i^ + i) -A-/ r(j + a) 1 
-9^^T{K-k + i) /iAr(j + i)r(a + 1) j 

When K goes to infinity, we end up with (1.3). 
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Hence, many properties of PD{9) can be derived from the corresponding 
results of the Dirichlet distribution with finite alleles. The approximation 
procedure used is to let K ^ oo,a ^ 0, and Ka —> 0. If a is held constant, 
then 6 will converge to infinity. This limiting procedure was first suggested by 
Watterson [22], and some results of the law of large numbers and fluctuations 
were obtained in Griffiths [11]. But this is not exactly the same as letting 
9 go to infinity in PD[9) since the latter is a two step limiting procedure: 
first Ka -^9 as — > oo, then 9 goes to infinity. 

In a large neutral population, the role of mutation is to bring in new type 
of alleles and to reduce the proportion of existing alleles. The parameter 
9 = 4Nu is the population mutation rate with u and N being the individual 
mutation rate and the effective population size, respectively. The limiting 
procedure of large 9 is equivalent to a situation where the mutation rate per 
individual is fixed and the effective population size is large. Motivated by the 
work of Gillespie [10] on the role of population size in molecular evolution, 
there have been renewed interests in the asymptotic behaviors of PD(9) for 
large 9 (see [3, 12, 13, 14]). 

The first topic in this article is the comparison in terms of large deviations 
between the finite allele Dirichlet { j^, ... ,-j;7) distribution and the infinite 
allele PD{9). Two types of limits are considered: the first one is — > oo 
followed by K ^ co; the second is 9 = K ^ oo. For the first type limit, the 
following diagram is commutative in terms of the law of large numbers: 



In terms of the law of large numbers, the diagram is still commutative when 
Dirichlet {-j^,. .. and PD{9) are replaced by the law of the size-biased 
sampling and the law of GEM, respectively. But we will show that it fails 
to be commutative in terms of large deviations. The second type limit cor- 
responds to the diagonal limit in the diagram. The results in [11] and [13] 
show that the central limit behavior of the finite allele Dirichlet distribution 
under the second type limit is the same as the infinite allele PD{9) for large 
9. We will show that the same relation holds for LDP. 

The second topic is the comparison between the population and a ran- 
dom sample based on the Ewens sampling formula. Consider as a certain 
population size. Then it would be interesting to see how large the sample 
size should be so that it behaves the same as the population for large 9. 

Here is an outline of the development of this article. In Section 2 the large 
deviation principles (LDP) are established for the GEM representation and 



(1.5) 




K large 



6 large 



il/K,...,l/K) 



(0,0,...)- 
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for the finite allele approximations. A detailed comparison is done between 
the LDPs for PD{9) and the GEM, and the corresponding LDPs for Dirichlet 
distribution and its descending order statistics. An entropy connection is 
established for the LDP result of PD{6) obtained in [3]. 

LDPs for the conditional and unconditional Ewens sampling formula are 
established in Section 3. The difference between the speed of the two LDPs 
indicates the strong effect of averaging and finite sample size. 

In Section 4 we consider the variable sample size and establish the relation 
between the sample size n, the parameter 9 and the LDP speeds for the 
total number of alleles, and for the age-class sizes. For the age-class sizes, 
the sample LDP will be the same as the population LDP if the sample size 
grows faster than 6. 

The reference [4] includes all the terminologies and standard techniques on 
large deviations used in this article. Since the state spaces encountered here 
are all compact, we do not make the distinction between a rate function and a 
good rate function. Generalizations to the two-parameter Poisson-Dirichlet 
distribution (cf. [17, 18, 19]) will be addressed in a separate article. 

2. LDP associated with PD{e). Let 

V = I (r,P2, ...).pi>P2>---> ^^flPk < l| 

be the closure of V equipped with the subspace topology of R°°. 

Let P = {Pi,P2, . . .) be distributed as PD{6). In [3] it was shown that an 
LDP holds for PD{9) with speed 6 and rate function 

{1 _ °° 

oo, else. 
As the first result of this section, we establish the LDP for the GEM. Let 

A = |(a;i,X2, . . .) > 0, A; = 1,2,. . . < l|, 

and X = (Xi,X2, . . .) be the GEM in (1.1) generated by the i.i.d. sequence 
of Beta{l,9) random variables Ui,U2,--- ■ Let Uf"^ denote the law of X 
on A. Since P and X stay in different spaces and the ordering operation is 
not continuous, the LDP for GEM does not follow easily from the LDP for 
PD{e). 

For each n > 1, set 

X„ = (^1, . . . , Xn), 

A„ = |(xi, . . .,x„) : > 0,fc = 1,2,. . . < l| 
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and 



'S'n(x) 



log^i ^ > xeA„;^Xfc<l, 

CX3, else. 



Lemma 2.1. For any n > 1, let H^^™ he the law of^n- Then the family 
{n^'^^ : > 0} satisfies a LDP on A„ with speed 9 and rate function Sn{-)- 

Proof. The LDP follows easily from Lemma 3.1 in [3], the indepen- 
dency of f/i, . . . , [/„ and the contraction principle. The rate function has the 
form 

inf< ^log—^^ — -^k = (1 -lii) • • • (1 -Uk^i)uk,k = l,...,n\, 
[i=i ^ J 

which is Sni') by direct calculation. □ 

Theorem 2.2. The family {Uf^ -.9 > 0} satisfies a LDP with speed 9 
and rate function 



(2-7) g(x) = r°g l-Er=i^. ' x = (xi,X2,...)GA,gx.<l, 
oo, else. 



Proof. First note that the topology on A can be generated by the 
following metric: 



d(x,y) = E 



oo I I 



k=l 



where x = (xi, X2, ■ . •), y = (yi, y2> • • •) ^ the space A is compact. 

Thus, to establish the LDP, it suffices [20] to verify that 

limlimsup^^^logn|'^™{(i(y,x) < 6} 

e-*oo 

(2.8) 

= limliminf^-^lognf'''{d(y,x) < 6} = -5(x). 
For each fixed 5i > 0, n > 1, and small enough (5 > 0, one has 
{d(y,x) < (5} c <^ sup {\yk - Xk\} < h\ , 

Ll<fc<n J 
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which imphes that 

Hmhmsup0~^logn|'^'^{(i(y,x) <6} 



< 



hmsupe* ^logUf^l sup {\yk-Xk\}<di 

e^oo ' il<k<n 



< - inf , 5'„(yi,...,y„). 

Since the very left-hand is independent of 5i and n, letting 5i go to zero 
and then n go to infinity, we get 

(2.9) limlimsup0^ilognf'"{d(y,x) <5} < -S{x). 

On the other hand, for each fixed 5 > 0, we can choose n large enough so 
that 

I sup {\yk-xk\} C{d{y,x) <5}, 

Ll<A;<n 

which leads to 

liminf0-ilognf'"{d(y,x) < 6} 

> lim inf e-^ log ( sup { - Xfc I } < 6/2 

> —Sn[xi, . . . , Xfi) 

> -5(x). 

By letting 5 approach zero, it follows that 

(2.10) limliminfe"Mognf'''{d(y,x) <5} > -S(x), 

which, combined with (2.9), implies the result. □ 

Remark. The rate function for GEM has the same form as the rate 
function for PD{9), which is expected because of its exchangeable form. 

For each fixed K >2, let (-P/^, . . . ,Px) be the decreasing order statistics 
of a Dirichlet{j^ , . . . , ■^) random vector. Then the law of {Pf' , ■ ■ ■ , P^ ) con- 
verges to PD{6) in the sense that, for every fixed r > 1, (-P/^, . . . , P^) con- 
verges to (Pi, . . . , Pr) when K approaches infinity. The LDP for (P/^, . . . , Pj^ ) 
when approaches infinity has been established in Theorem 2.1 of [2] with 
the rate function given by the relative entropy 



/^(pi,...,Pi.) = E^iog^. 



i=l 
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We now investigate the structure of diagram (1.5) in terms of the large 
deviation rate functions. 

Type I limit. 9 goes to infinity followed by K approaches infinity. 

Theorem 2.3. 

Sr{pi,...,Pr) 

= lim inf <^ (pi, ■ ■ ■ ,Pr, Qr+i, ■■■,qK)- 

il— >0O I 

r K ^ 

Pi > • • • > Pr > Qr+l >qK>0,^Pi+ X! 9i = 1 ^ 

i=l i=r+l ) 

Proof. The equality holds trivially if pi H hPr = 1- We now assume 

that Y7i=iPi < 1. It follows that 

I^{pi,...,Pr,qr+i,---,qK) 

V- 1 , 1 K-r^ 11 1 
= ^ log + —iF- IF + 1^ 



^ K Kpi K K K qr+i ■■■qK 

Since Ylf=r+i Qi = ^ ~ J2i=i Pi Qr+i ■ "Qk reaches the maximum when 
they are all equal, it follows that 

mf{I^ {pi, . . . ,Pr,qr+l, . . . ,qK) ■Pl>--->Pr> Qr+l >qK>0, 

Pl H \-pr + qr+l H hgi^ = 1} 

r, 1 1^, 1 K-r, K-r 



— log — I — log — I lo! 

K ^ K K ^ Pi K ° K 



K-r 1 
■log- 



K "1-ELiK' 
By letting K go to infinity, the equality follows. □ 

Remark. The LDP rate function for (Pi, . . . , P^.) under PD{9) has been 
shown to be S'r.(-) in [3]. Note that (q) is the relative entropy of (-^, . . . , -^) 
with respect to q. Thus, we are able to establish certain connections between 
relative entropy and the LDP for PD{9). This also makes a connection be- 
tween the LDP for Dirichlet{v) in Dawson and Feng [I, 2] and the LDP for 
PD{9). The LDP speeds for (Pi, ... , P,.) and (P/^, . . . , P^) are both 9. 
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Let Vi,...,Vk-i be independent random variables with Vi having a 
Beta{^ + 1, {K — distribution, and 

(2.11) Y^ = Vi, Yi'' = {l-Vi)---{l-Vi.i)Vu i = 2,...,K-l, 

be the size-biased samphng of the symmetric Dirichlet{j^ , . . . distribu- 
tion. 

Lemma 2.4. The family of the laws of (Y"/^, . . . , Y^_^) satisfies a LDP 
on Ak~i with speed 9 and rate function 



1, , K-i^ i-UZivi 

— log — 1 -— log 



K ° Vi K °i-ELiy« 

+ E 



1=1 

K-l r 



1=1 



1 , 1 K-i^ K-i 

— log 1 log 

K ^K+l-i K ^K+l- 



Proof. The LDP with speed 9 for Vi for each i = \, . . . ,K — 1 can 
be established by a direct application of the Laplace method and the rate 
function for Vi is 

lK,i(v) = ^ log - H — log ■ 



K ° V K ° 1 - u 

1 , 1 K-i^ K-i 
+ — log — : H —- log ■ 



K °K + l-i K ^K + l-i 

which implies the result by a combination of independency of 14, ... , Vk-i 
and the contraction principle. □ 

Theorem 2.5. The LDP rate function for iXf , ■ ■ • ,^r^) for each fixed r 
does not converge to the LDP rate function for (Xi, . . . , X^) as K approaches 
infinity. 

Proof. It follows from Lemma 2.1 that the LDP rate function for 
is Sr{xi, . . . ,Xr), which is finite as long as X)i=i < 1- By 
contraction principle, the LDP rate function for (1^/^, . . . jY^f^) is given by 

inf{S'^(xi, . . . ,Xr.,yr+i,- ■ -.yK-i) ■■ (xi, . . .,Xr,yr+l, ■ ..,yK~i) S Ak~i}, 
which is infinite when xi = for every K. □ 



Thus, under the Type I limit, the diagram (1.5) is commutative in terms of 
the LDP rate functions for the ordered distributions but not for the GEM. 
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Type II limit. 9 = K ^ oo. 

When 9 = K, the Dirichlet{-^ , . . . , ■^) distribution becomes the uniform 
distribution on the simplex {(xi, . . . , xk) Xi >0,i = 1, . . . ,K; J2iLi Xi = 1}. 
For each fixed r > 1, the density of {Pi , ■ ■ ■ ,P^) is given (cf. [21]) by 

(2.12) g{pi,p2, . . . ,Pr) = K{K-l)---{K-r + l)T{K)L{r, K; B), 

where 

L{r, K;B) = J . .. J dxr+i ■ ■ -dxK-i, 



B = <^{xr+i, . . .,XK-i)-0< Xi<pr,i = r + 1,...,K -1; 

K-l >| 

Xie[l-ar-pr,l-ar]\ 



i=r+l 

and Ur = J2^j=iPj- If I — ar <Pr, then 

B = i^{xr+i, . . .,xk^i):0< Xi<l-pr,i = r + l,...,K-l; 

K-l >| 

^ Xi e [0, 1 - ar] 

i=r+l J 



and 



L{r,K;B)= J... J x^j^ 



(2.13) 



[0,l-ar]®-^-'-2 
(l_a^)X-r-l 

T{K-r) 



To deal with the case of 1 — a,- > let X^+i, ■ ■ ■ , Xk-i be i.i.d. uniform 
random variables over (0,Pr)- Then 



K-r-l 



L{r,K;B)=p^~''^^P<l-ar-pr< ^r+i < 1 - Or 



(2.14) 



{K-r-l 
i=l 



■K-r-l 



P\ Y ^^+« <^-Cir-Pr 



i=l 
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Let Zr+i, ■ ■ ■ , Zk-i be i.i.d. uniform random variables over (0,1 — a^). 
Since 1 — > Pr, the conditional distribution of Zi given Zi < pr is the 
same as the law of Xi. Hence, by direct calculation, we get 



'K-r-\ 

^ " a,. 

{K-r-\ \ 
ZrM<^-ar\ 

_ (l-g^)^-"-^ 
V{K-r) 

=i:{^~\~^y^--r-p.)'pr'~'~' 

(2.15) 

(K-r~\ 



X 



l<i<K-r-\ 

^ X,+,<l-a, , 

i=m+l J 

where 

m = inf{A; > 1 : kp^ > 1 — a^.} 

and 

(2.16) C{K,T,m) = Yy j(l-a,-p,)Vr'- 

This combined with (2.13) implies that 

C[K + m,r,m)L [K + m — r) ' ^ 



< (l-«r.) 



i=l 
K-r-l 



T{K-r) 
Similarly, we can prove that 

(K-r-l ~) (1 _ _ \K-r-l 

(2.18) pf-'-lp Y Xr+^<l-ar-pr\<^ 



1=1 



T{K-r) 
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Taking into account (2.14) we get 



T{K-r) 

[1 - a^)^'-''-! (1 - a,)™r(ir - r) 



> 



(2.19) 



T{K — r) C{K + m,r,m)T{K + m — r) 
1 — C{K + m, r, m) 

K-r-l 



1 - a,. - pA^-""^ T{K + m-r) 



1-ar J (1 -ar)™r(K-r) 



Theorem 2.6. The family of the laws of {P{^ , . . . ,P^) for each fixed 
r satisfies a LDP with speed 6 and rate function Sr{pi, ■ ■ ■ ,Pr) as 6 = K 
approaches infinity. 

Proof. Let 

Vr = I (gi , . . . , gr ) : < < • ■ • < gi , ^ < 1 1 • 

For each 6>0 and {pi,...,pr) in V^, let B = B{{pi, . . . ,pr),6) and B = 
B{{pi, . . . ,pr),6) denote, respectively, the open and closed balls in cen- 
tered at (pi, . . . ,Pr) with radius 5. Consider the case of J2k=iPk < l,Pr > 
first. By choosing 5 small, we can have J2k=i Qi <^,Qr > for all {qi, . . . , qr) 
in B. It then follows from (2.19) that for each (gi, . . . ,qr) in B 

(2.20) lim ^log5(gi,...,g,.) = -log- ^ 



Choose {q\,...,ql) in B such that 

ql = m.i{qr:{qu...,qr)^B} 

and 

^qi = sup< ^qk:{qi,...,ql)^B\. 
k=i lfc=i J 

The existence of such point follows from the continuity of the corresponding 
functions in the above definition. 

Since the density function g{qi, . . . ,qr) is increasing in qr for fixed qi, . . . , 
qr-i, and decreasing in J2k=i9k for fixed qr, we get that 

hmliminf ilogP{(Pf , . . . ) G B} 
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= limliminf ^log / g{qi, . . . ,qr)dqi ■ ■ ■ dqr 

(2.21) > limliminf ^log / g{qi, . . . ,qr-i,qr) dqi ■ ■ ■ dqr 

5^0 e^oo a Jb 



- l^^n I / diQi, ■■■,qr)dqi---dq., 




r 



= 1™ log -2L^%]= -Sr{pi, ■ ■ ■ ,Pr 

where (2.20) is used to get the second equality. 
On the other hand, from (2.19) we have 



g{qi,...,qr)<K{K-l)---{K-r + l)- 
Thus, 

lim lim sup I \ogP{{Pf, ...,P,^)e B} 

< lim ^logii:(i^-l)---(i^-r + l) ^^^^ 



(2.22) 



ooO ^ ^ ' ^ 'T{K-r) 

+ lim lim ^ log /_ (1 - gi Qr)^'^'^ dgi • • ■ dg,. 

(5— iOe— >oo (7 Jb 



= _log-i — . 

L — Qr 

Since the state space is compact, partial LPD holds. Prom (2.21) and 
(2.22), all partial rate functions are the same and equal to Sr{pi, ■ ■ ■ ,Pr) 
on the set {(pi, . . . ,pr) S ^rT,l=iPk < '^,Pr > 0}. If J2l=iPk < 1 and there 
exists kf) <r such that pk > for k < kQ — 1 and = for k > ko, then, 
for any partial rate function /', we have I'{pi, . . . ,pr) < Sr{pi, . . . ,Pr) due 
to the continuity of Sr{pi, . . . ,Pr) and the lower semi-continuity of /'. Thus, 
the lower bound still holds. If X]fc=iPfc = I> the lower bound is trivial. 

The upper bound holds true in all cases due to the monotonicity of the 
function (1 - ELi %)^"''"^ in ELi*- Thus, (2.21) and (2.22) hold in all 
cases. This combined with the compactness of the state space implies the 
theorem. □ 



For each fixed r > 1 and the size-biased permutation defined in (2.11), we 
have the following: 

Theorem 2.7. The family of the laws of {Y^ , . . . , Y^) satisfies a LDP 
with speed 9 and rate function Sr{yi, ■ ■ ■ ,yr) as 9 = K approaches infinity. 
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Proof. Noting that Vi, . . . ,Vr are independent and Vi has a Beta{2, 6 — 
i) distribution, it follows that the law of {Vi, . . . ,Vr) satisfies a LDP with 
speed 6 and rate function 



^log- 



as 9 = K approaches infinity. The theorem then follows easily from the con- 
traction principle. □ 

Remark. From Theorems 2.6 and 2.7, we conclude that the LDPs for 
the finite allele model under the Type II limit are the same for the infinite 
allele model for large 9. A similar result is expected to hold under the general 
limit of lim6i_>oo = O 0. 

3. LDP for Ewens sampling formula. For each fixed n > 1, let A„ be 

the random partition defined in (1.2). For a given allele proportion p = 
{pi,P2, ■ ■ ■) ii'^'V , and (ai, . . . , a„,) in An, the conditional sampling probability 
Fa„(p) = P{A„ = a„ = {ai,...,an)\P{9) = p} is given by (cf. Kingman [16]) 

(3.23) Fa„(p) = C{n, a„) ' ' ■Pha.pLpL ' ' ■Pha.Pli ' ' ' ' 

where 



C(n,a„) 



n! 



and the summation is over distinct 

If we extend the function Fa,^ (p) directly to V, then we have the following: 



Lemma 3.1. The function -Fa„(p) is continuous on V if and only if 
ai =0. 

Proof. Let k = J27=i '^i ^"^^ ai = r. If r = k, then k = n. For each m > 1, 

let 

Pm 




,m m 

m 



which converges to (0, . . .) as m goes to infinity. By direct calculation, 



n J \m J n 
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Next we assume that 1 < r < fc. For any m>r\/{k — r), let 

k—r 



_/ 1 11 1 \ 

~ U(fc - r) ' • • • ' 2(fc - r) \2;^' • • • ' W ' • • 7 ^ ' 

m 

which converges to q = ( 2{k-r) ' • • • ' 2(k-r) ' ^' • • •) goes to infinity. Write 
-Pk„(p) =C(n,a„) 



E+E 

L 1 2 J 



PhiPh2 • • -PhaApL ■ ■ •^La^'L ■■■■ 



where J2i is over indexes such that {lij : i = 2, . . . , n; j = 1, . . . , ai} = {1, . . . , 
k — r}. 

By direct calculation, we get 



a2,...,anj \r J \2{k-r)J \2m J \m 

where the o(^) follows from the fact that each nonzero term in involves 
higher orders of 1/m. 

Thus, Fa„(p) is not continuous for ai > 1. Next we assume that ai = 0. 
Set N = max{/2a2 > • ■ • ) ^n.a„ }• Then for each M > 1, 

i^a„(p)=FijP)+^a„(P), 

where 

(P) = C{n, a„) ^ pI^pI^ ■ ■ ■ pI^/,^^ • • • , 

N<M 

FL (p) = C{n, a„) 5] p2 ^ • • • pI^^pI^ 

N>M 

Clearly, F^^[p) is continuous in p. Let Tl denote the collection of partitions 
of n — 1 obtained from a„ by removing one individual from the sample. 
Then 

Thus, for any p,q in V, 
|F|a„(p) - Fa„(q)| < iF^Jp) - F^Jq)] + |Ff Jp) - F^Jq)! 



<l^i (p)-i^i (q)l + 2^^^^lE€W}77^-^ 

-I anVi-; a„VM/i 1^ 'C{n-1, 
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which imphes the continuity of -Fa,i (p) and the lemma. □ 

Now Theorem 4.4 in [3] combined with the contraction principle leads to 
the following: 

Theorem 3.2. For each a„ in An with ai = 0, the family of the laws of 
-^a„(p) under PD{0) satisfies an LDP with speed 6 and rate function 

(3.24) (x) = inf j log : Fa„ (p) = x 1 . 

Remark. If PD{9) is replaced by the finite allele symmetric Dirichlet 
distribution, then the law of Fa„(p) will satisfy the LDP for all a„. The 
function F^^ (p) can also be extended continuously to V by replacing X^i^i Pi 
with constant 1. By using this extension, the law of Fa„(p) satisfies a LDP 
for all a„. More detailed discussions on this extension are found in [6]. 

Next we turn to the large deviations associated with the Ewens sampling 
formula. The state space is now An and the random element is A„. 

Theorem 3.3. The family of the laws of An under PD{9) satisfies an 
LDP with speed log 9 and rate function 

n 

(3.25) /esf(a) =n-^aj. 

i=l 

Proof. From the Ewens sampling formula, we have 

1 1 f n' 6*^111 "0 

log P{ A„ = a} = — — <^ log ™ , + log ■ 



loge " ^ " ' \og9\ "n"=ij"^%! o^n) y 

The theorem follows by letting 9 go to infinity. □ 

For the i^-allele symmetric Dirichlet{-^, . . . , -^) distribution, the sampling 
formula has the form 

(3.26) ^i^--^^-Yl^^^a,\{K-T;^=^a.V.e^n)l\\ m9/I<) J ' 



Theorem 3.4. Assume that 9 grows with K and 

9 

hm ^ = ce (0,+ooJ. 

Then the family of the law of A„ under Dirichlet {j^ j^) satisfies an 
LDP with speed logK and rate function /csf(a)- 
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Proof. For c < oo, 
1 



logK 



logP{A„ = a} 



1 



log 



ni 



+ log- 



K\ 



+ log 



^(„)(K-Er=i«.)! 

" / r(i + g/K) y/ 
/i^l T{e/K) J 



— > — /csf(a) as K goes to infinity. 
For = 00, by Stirling's formula, we get 

^ logP{A„ = a} 



logiiT 



1 



logi^ 



log 



+ log 



+ log 



J_YTfni + e/K)_y^]\ 



logK 



log 



-4sf(a) 



A^(K-i)-.-(K-Er=i«.+i) 

+ log 

as K goes to infinity. 



+ 



1 



log A' 



□ 



Remark. An interesting feature of this theorem is the fact that the 
large deviation speed and rate function do not depend on the exact speed 
of 9 as long as it grows no slower than K. 

4. Scaling limit with varying sample size. From the previous section, 
we see that the LDPs for the conditional Ewens sampling formula and the 
unconditional Ewens sampling formula have different speeds. This is due to 
the averaging and the finite sample size which reduce the randomness. It is 
thus natural to consider the case of varying sample size. 

There are four possible relations between the sample size n and parameter 
6, namely: 

Case A: n fixed and 6 approaches infinity. 
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Case B: n grows with 9 and \\m.Q^ooG /n = oo. 
Case C: n grows with 6 and \\m.Q-^ao9 /n = c > 0. 
Case D: n grows with 6 and Ume_,oo ^/^^ = 0. 
Define 

logo, 



a{9) 



nlog 

e, 

9 log 



Case A, 
Case B, 
Case C, 
Case D, 



and 



PiO) 



riog^, 


Case 


A 


log-, 


Case 


B 


n 








Case 


C 


n 






U, 


Case 


D 



Let 



Kn{9) = J2Ai 



i=l 



be the total number of distinct alleles in a random sample of size n. Let \S^\ 
denote the coefficient of 9^ in Then the distribution of Kn{9) is given 
by (cf. page 114 in [8]) 



(4.27) 



ak 

P{Kn{9) = k} = \St\ — . 



The moment generating function of Kn{9) is calculated as 

tK^,_{e'0)in) _T{9e' + n)m 



(4.28) 



M{t) = E[e' 



T{9et)T{9 + n) 
For large 9, we obtain the following result. 



Theorem 4.1. 



(4.29) A(t)= lim -±- log M{P{9)t) 



where, 
(4.30) 



Ai(t) 



nt, 

(t+l) 



Ai(t), 


Case A 




Case B 




Case C 


A4(t), 


Case D 


-1, 





n, 



else, 
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(4.31) Mt) = [\ 



t, ift>-l, 
else. 



(4.32) As{t) = i{[clogc - (1 + c) log(l + c)] 

+ [(1 + ce^*) log(l + ce^*) - ce"* logce^*]} 

(4.33) A4(t)=e*-1. 

Proof. Case A and Case C follow easily from direct calculations. For 
Case B, we use the Stirling formula to get 

1 



aie) 



log Mimt) 



1 L /l + (^/n)*+i\ ^ / 1 V^/") 



log 9/n 



t+i 



l + 9/n j+^°^l'+(W+^ 

1 \ (9/nr 



log( 1+ . . 
i9/n) 



-0(1) 



t, ift>-l, 
— 1, else. 



For Case D, using Stirling's formula again, we get 

1 i„„ _ tlog(l + n/6le-*) log(l + n/6') 



■ log M{p{e)t) = e* ^\ ;^ — ^- 7 +o(i) 



a(0) logn/^ logn/0 

^ e* - 1. □ 

Theorem 4.2. /n Case A, t/ie family of the laws of Kn{0) on space 
{1, . . . ,n} under PD{9) satisfies an LDP with speed log9 and rate function 

I{k) =n — k. 

Proof. For each A; = 1, . . . , n, it follows from (4.27) that 

1 1 9^ 

lim - — - log P{Kn{9) = k]= lim - — - log — 

e^oologt' e^oologC^ 

= -{n-k), 

which implies the result. □ 

Theorem 4.3. In Case B, the family of the laws of Kn{9)/n on space 
[0, 1] under PD{9) satisfies an LDP with speed nlog^ and rate function 

I{x) = 1 — X. 
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Proof. As the coefficient of 9^ in the number Ues between 



{f^tJi and 



(fc-i)! • fohows from Stirhng's formula that 



log 



: n 



n 



log 1 + - +- log log- 



n 



n 



n 



log 1 + 



1 \e/n 



e/n 



+ o(l] 



-a{e) 



n 



+ 0(1) 



and 



log 



n — 1 
k-l 



k-1, k-l 



■log- 



n — 1 77,-1 



= -(77-1) 

= -a{6)o{l). 
Thus, it follows from (4.27) that 



+ 1 



k-l 

77 — 1 



n — k , ' 
log -+0 1 

77 — 1 



1 



a{9) 



logP 



Kn{0) ^ k 
n 



n 



l--+o{l)]. 



For each x in [0, 1] and 5 > 0, the total number of integers in 
{1, . . . , 77} n [77X — 77J, 77X + 77J] 
is of the magnitude of n5 which is a{6)o{l). Hence, 



lim lim inf — -— log P 

5^0 9^00 a{6) 

1 



KniO) 



77 



lim lim sup — — log P 

S^o e^oo a(^) 



<<5 



Kni9) 



n 



<6 



= -{l-x), 

which combined with the compactness of [0, 1] implies the result. □ 

Theorem 4.4. In Case C, the family of the laws of Kn{9)/n on space 
[0, 1] under PD{9) satisfies an LDP with speed 6 and rate function 

I{x) = sup{tx - A^it)}. 
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Proof. Extend the law of Kn{0) jn to the whole real line K and denote 
the extension by Pg.n- 

According to Definition 2.3.5 in [4], the function A3(t) is essentially smooth. 
By the Gartner-Ellis Theorem, a LDP holds for the family {-Pe,n : ^ > 0, n = 
1, . . .} with rate function 

i{x) = sup{ix - A3(t)}. 

t&R 

To prove the theorem, it suffices to show that I{x) = oo for x ^ [0,1]. 
Since 

lim K^{t) = -[clogc - (1 + c) log(l + c)], 

by letting t approach negative infinity, one gets that I{x) = oo for x < 0. By 
direct calculation, 

(1 + ce^*) log(l + ce"^) - ce"^ log(ce"*) 
= log(l + ce^*) + (ce^*)logfl + ^ 



~ ct as t approaches positive infinity. 
Hence, by choosing t > large, we get that I{x) = oo for x > 1. □ 

Since A4{t) is also essentially smooth, by an argument similar to that used 
in the proof of Theorem 4.4, we get the following: 

Theorem 4.5. In Case D, the family of the laws of Kn{0)/ 9 log ^ on 
space [0,+oo) under PD(9) satisfies an LDP with speed ^log^ and rate 
function 

I{x) = xlogx — X + 1. 

As a by-product of these LDPs, we get the following weak laws of large 
numbers. 

Corollary 4.1. 

(4.34) Case A, limi^„(^)=n, 

6*— >oo 

(4.35) Case B, lim = 1, 

e^oo n 

(4.36) Case C, lim = log + - 

0— >oo n \ c 

(4.37) Case D, lim = 1. 

e~*oo 9\og{n/6) 
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Proof. In Case A, the rate function has a unique zero point at n. Thus, 
(4.34) holds. Similarly, results (4.35) and (4.37) follow from the fact that the 
corresponding rate functions have unique zero point at 1. To prove (4.37), 
rewrite A3(t) as 

As{t) = -jF{ce^')-F{c)], 

where F(x) = (1 + rr) log(l + x) — x\og{x). Then, by a change of variable of 
u = ce*^* , it follows that 



(4.38) h{x) = - 

c 



supjrrlogu — F{u)} — (xlogc — F{c)) 
.u>0 



For each fixed x G (0, 1), the supremum in (4.38) is achieved at a unique 
satisfying 

The cases of x = or 1 correspond to or n — > oo, respectively. Thus, 
/3(x) > unless = c or, equivalently, for x = log(l + ^T- This leads to 
(4.36). □ 

Remark. It follows from (4.37) that the LDP in Case D is very similar 
to the case studied in [9] where 9 is fixed and n approaches infinity. 

The parameter 9 is proportional to certain effective population size when 
the individual mutation rate is held constant. The limiting procedure of 9 
approaching infinity is thus equivalent to letting the population size go to 
infinity. This next result compares sample size n with the effective population 
size through the study of the age-class size. It reveals how large the sample 
size should be so that the sample and the effective population will behave 
the same as 9 approaches infinity. 

Let Xi^n, ■ ■ ■ ,Xn,n be the age-class sizes in the sample. Then from Don- 
nelly and Tavare [5], one has 

^( fc ) n[n — k)\ (c^ + n— Ij! 



Theorem 4.6. In Case A, the family of the laws of Xi^n on space 
{1, . . . ,n} satisfies a LDP with speed \og9 and rate function k — 1. 



22 



S. FENG 



Proof. A direct calculation gives 
1 



(4.40) 



6»->oo log 6 



\ogP{Xi^n = k} 



l-Vlim^°g(^ + "-^^=-(fc-l). 
1=1 ° 



□ 



Theorem 4.7. T/ie family of the laws of on space E = [0, 1] satis- 
fies a LDP with speed ^{6) and rate function I{x) given respectively by 



(4.41) 



and 



(4.42) 



where 



(4.43) 



7(^) 



n\og-, 
n 



I(x) 



X, 

Ic{x), 

log 



1 



1 — X 



Case B, 

Case C, 
Case D, 



Case B, 
Case C, 

Case D, 



/,(x) = -[(c+l)log(c + l) 



+ (1 — x) log(l — x) — (c + 1 — x) log(c +1 — x)]. 



Remark 1. The last case is the same as the LDP for the GEM. In other 
words, when n grows faster than 0, the random sample behaves the same as 
the population. 

Proof of Theorem 4.7. It follows from Stirling's formula that 
logP{Xi,„ = A;} 

■ (n + l)"+i/2(n + e- ky+n-k-i/2- 



(4.44) 



where 



+ log- + o 1 

n 

h + h + h + h + h + h + o(l), 



h =log-, 
n 
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log 



h 
h 
h 

h = n\og 



n+l 



n 



+ log - + 1 

n 



A:-- log 1 + , 



n n 



1 



n+l 



n + 1 — /c ' 
+ n) log ( 1 - 



For each x in E, let \nx\ denote the integer part of nx. It is not hard 
to see that in Case B Ii + 12 + 1^ + lb + = 0(7(0)) uniformly in k/n and 
lii^^^) h{0) converges to —x uniformly in x. In Case C, Ji +/2 = o{j{9)) 



uniformly in k/n and 7^[l3 + ^4 + ^5 + leli^^) —Ic{x) uniformly in x. 
Thus, we have 

(4.45) 



lim , ^ 
6-007(6') 



\ogP{X^,n=[nx\} = -I{x). 



For each x \n E and > 0, choose n large enough so that \nx\ is in the 
interval {nx — n5, nx + n6) . Then 



(4.46) 



P{Xi,n = lnx\ } < P{\Xi^n/n -x\<S} 
<P{\Xi,n/n-x\<5} 

\k/n—x\<5 



< 2{n5 + 1) max P{Xi^n = k}. 

kG[nx—n5,nx+n5] ' 



By the uniform convergence, and (4.46), we get 
1 



(4.47) 



lim lim 

5^09^00 7(0) 



logP{\Xi^n/n-x\ < 6} 



■ lim lim 

(5— 9— 00 7(6') 



logP{\Xi,n/n- x\ <6} = -lix). 



The maneuvering in Case D is a little different. By a reorganization of 
the terms, we have 



(4.48) 



log = k} = Ji + J2 + J3 + J4 + J5 + 0(1), 
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Jl = log 



J2 = 
J3 = 
J4 = 

J5 = 

Noting that 



log 



/clog 
nlog 
9 log 



- logn, 
/ 1 + 1/n 
VI + 1/n - ^/n 
1 + 1/n — k/n 
l + e/n-k/n 

{n+l){0 + n-k) 
{n+l-k){e + ny 

9/n + l-k/n 
9/n + l 



+ log 



1 + e/n 



1 + 9/n - k/n 



(4.49) 



n(5 min P{Xi^n = k} 

kS:[nx—n5,nx+nS] 

<P{\Xi^n/n-x\<5} 

< 2{n5 + 1) max P{Xi^n = k}, 

kS[nx—nS,nx+n5] 



by taking the logarithm, the term — logn in Ji is cancelled by log(n5) and 
the term log^ clearly grows slower than 7(0). 

First consider the case < x < 1, and choose 6 small enough so that 
x + 6<l. 

It is clear that J2 = o{6), and | J5 converges uniformly to log(l — y) over 
[x — 6,x -\- 6] as — > 00, k/n^ y. For J3 and J4, we have 



1 



k 



log 1 + 



kn . , , 
-;^log( 1 + 

y 



{i-e)/n 

l + e/n-k/n 
{l-e)/n 



1-y 



l + 6/n-k/n, 
as 6* — > cxD, fc/n — > y 



and 



:J4 



log 1 + 



n 



77- 

7:log( 1 + 



{e-i)k 



n2 + n6' + (l-A;)(n + 6') 

(0- l)fc/n2 
1 + (9/n + (1/n - A;/n)(l + 6'/n) 

as 6 ^ 00, k/n y. 
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Summing-up all the terms, one gets 

lim lim - logP{\Xi^n/n — x\ < 6} 

(4.50) =lim lim ]-logP{\Xi^n/n - x\ <6} 

= -log-^. 

1 — X 

Finally, for x = 1, the result still holds from the fact that P{Xi^n = k} is 
decreasing in k and 

max P{Xi,„ = fc}<P{Xi,„= K1-5)J}. 

k(l:[nx—n5,nx+nS] I — I 

This result can be generalized to the first r family sizes in a sample of 
size n. From Donnelly and Tavare [5], one has, for ki >l,i = 1, . . . ,r, 



(4.51) 



{e/nf 



(1 — ki/n) •••(! — ki/n — ■ ■ ■ — kr-i/n) 

n! {9 + n-ki fe^ - 1)! 

^ {n-ki A;^)! (e + n-1)! ' 

This formula is very similar to (4.39) and all asymptotic calculations are 

almost the same except one needs to replace k there with ki-\ l-k,.. The 

state space is 

|(A;i,. . .,kr):ki>l,i = l,...,r;Y^^f^j <n^ 



in Case A and 



{xi,...,Xr):xi G [0,l],i = l,...,r;^Xj <1 



Table 1 



Case Speed Rate function 

A loge m=,h-r 

B nlog£ 

c e ^c(E:=i^O 
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in all other cases. The corresponding LDP results are summarized in Table 
1. 
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